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(54) Title: CRYSTALLINE FGF9 DIMER AND METHODS OF USE 

(57) Abstract: The crystal structure of FGFV has been determined and is shown to exist in a tetragonal space group I4i with lattice 
constants a = 15 1 .9 A and c = 1 17.2 A. The crystal may be refined to an R value of R = 22.0 % at 2. 6 A resolution. The crystal may be 

Q used in drug screening assays. A three-dimensional model of FGF9 is also disclosed, as is a three-dimensional computer image of the 
three-dimensional structure of FGF9, computer-readable data storage medium encoded with computer-readable data corresponding 

^ to the three-dimensional computer image, as well as computers for producing such a three-dimensional representation. 
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CRYSTALLINE FGF9 DIMER AND METHODS OF USE 

FIELD OF THE INVENTION 

The present invention relates to certain crystallized fibroblast growth 
5 factor 9 (FGF9) dimers. This invention also relates to computational methods for 

using structure coordinates of the protein complex to screen for and design compounds 
that interact with FGF9 or homologs thereof and methods of using the crystal structure 
of FGF9 to design pharmaceuticals. 

BACKGROUND OF THE INVENTION 

1 0 Fibroblast growth factors (FGFs) constitute a family of at least twenty 

structurally related, heparin binding polypeptides which are expressed in a wide 
variety of cells and tissues. They stimulate the proliferation of cells from 
mesenchymal to epithelial and neuroectodermal origin. FGFs share structural 
similarity, but differ in their target specificity and spatial and temporal expression 

1 5 pattern. The biological response of cells to FGF is mediated through specific, high 
affinity (Kd 20-500 pM) cell surface receptors that possess intrinsic tyrosine kinase 
activity and are phosphorylated upon binding of FGF (Coughlin et al 1988). A lower 
affinity (Kd 2 x 10 9 M), large capacity (10 6 sites/cell), class of binding sites has also 
been identified as heparin sulfate moieties of proteoglycans. Heparin sulfates are 

2 0 ubiquitous polysaccharides, composed of repeating disaccharides of variably sulfated, 
either glucoronate or iduronate and glucosamine residues, arranged in distinct domains 
which greatly vary in length and sulfation levels. A unique role for these molecules is 
in the formation of distinct complexes, essential for high affinity binding and 

1 
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activation of FGF in particular and of other heparin-binding growth factors in general 
(Yayon et al, 1991 ; Rapraeger et al 1991). 

Ligand and receptor dimerization is a key event in the transmembrane 
signaling of receptor tyrosine kinases. Receptor dimerization leads to an increase in 
5 kinase activity, resulting in autophosphorylation and the induction of diverse 

biological responses (Schlessinger et al, 1992). Several models have been proposed 
for the interaction between FGF2 -heparin and its receptor (Y ayon et al, 1991 ; 
Ruoslahti 1991; Spivak-Kroizman et al; 1994, Kan et al; 1993, Guimond 1993; 
Pantoliano et al, 1 994). Previous work utilizing NMR demonstrated that FGF dimers 

10 in a symmetric tetramer are formed in the presence of an active heparin decasaccharide 
(Moy et al, 1997), suggesting that a cis-oriented dimer is the minimal, biologically 
active structural unit of FGF2. Using defined heparin fragments and soluble FGF 
receptors further demonstrated that ligand dimerization can significantly enhance 
binding of FGF2 to FGFR1, dimerization of the receptor and induction of downstream 

1 5 signal transduction pathways. More recently, several studies (Plotnikov et al, 1999; 
Stauber et al, 2000; Plotnikov et al, 2000) exploring the crystal structure of a complex 
between FGF2 and FGF1 with the extracellular domains of FGFR1 and FGFR2 have 
shown a 1 :2 molecular ratio of ligands to receptors with no evidence for ligand 
dimerization, the biological relevance of which has still to be determined. 

2 0 FGFs share in their primary sequence a homology core of around 120 

amino acids, including four cysteine residues, one of which is conserved in all 
members of the family. The core structure contains 12 antiparallel |3 strands, 
organized into a threefold internal symmetry. Equivalent folds have been observed for 
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the soybean trypsin inhibitor and interleukin LL-la and b. The best characterized 
members of the family are FGF1 (aFGF) and FGF2 (bFGF), the structures of which 
have been determined (Zhang et al, 1991; Zhu et al, 1991). Both are potent mitogens 
that stimulate proliferation, migration and differentiation of a large variety of cells 
5 (Folkman et al, 1987; Rifkin et al, 1989). 

FGF9, a recently identified member of the FGF family was originally 
discovered as a heparin binding glia activating factor (Miyamoto et al, 1993; U.S. 
patents 5,622,928 and 5,5 12,460). Human FGF9 codes for a 208 amino acid protein. 
It shares a 30% overall sequence identity with other FGFs but has a unique spectrum 

10 of target cell specificity as it stimulates the proliferation of glia and other fibroblast- 
like cells but is not mitogenic for endothelial cells (Naruo et al, 1993). The basis for 
such cell selectivity resides in its differential capacity to bind the different FGF 
receptors. Recombinant FGF9 binds with high affinity and in a heparin dependent 
manner to FGFR3, with somewhat less affinity to FGFR2 and with considerably less 

1 5 to FGFR1 (Hecht et al, 1 995). 

Mutations in FGFR3 have been shown to be responsible for 
achondroplasia, the most common form of genetic dwarfism. Examination of the 
sequence of FGFR3 in achondroplasia patients identified a mutation in the 
transmembrane domain of the receptor. 

20 As reported in WO 96/41 523, the entire contents of which are hereby 

incorporated herein by reference, FGF9 not only specifically binds to the FGFR3, but 
also specifically activates this receptor without activating the FGFR1 and FGFR4 
receptors and, if appropriate concentrations are chosen, without significantly activating 
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FGFR2. Thus, a pharmaceutical composition comprising a pharmaceutical^ 
acceptable carrier and, as an active ingredient, a therapeutically effective amount of 
FGF9, may be used for stimulating the activity of FGFR3. Similarly, if antagonists of 
FGF9 could be found, pharmaceutical compositions containing such antagonists could 
5 be used to attenuate the activity of FGFR3 . 

Normal cartilage and bone growth and repair of damage to the cartilage 
and bone requires a specific and delicate balance between up regulation and down 
regulation of the activity of the FGFR3. It has been theorized that active FGFR3 is 
necessary in the initial stages of cartilage-bone differentiation, and, after 

1 0 differentiation, is required for cartilage-bone repair. Thus, a pharmaceutical 

composition comprising as an active ingredient FGF9, which stimulates the activity of 
FGFR3, may be used in order to encourage cartilage and bone repair, for example by 
administration to the site of injury. Furthermore, FGFR3 exists usually temporarily on 
mesenchymal stem cells and usually disappears after differentiation. Administration 

15 of FGF9 may serve to stabilize FGFR3 and thus prolong the period in which it is 

active prior to differentiation. FGF9 has also a chemotactic affect of FGFR3 -carrying 
cells and can promote migration of such FGFR3 carrying cells, typically mesenchymal 
stem cells, to a desired site, for example, by injection of FGF9 to the growth plate top 
of the column. 

2 0 According to this theory, overactivation of FGFR3 after the stage of 

initial differentiation of bone and cartilage cells, leads to halted growth, and is 
probably the cause of achondroplasia. Thus, a pharmaceutical composition 
comprising as an active ingredient an antagonist of FGF9 which attenuates the activity 
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of FGFR3, or comprising an FGF9 binding agent (such as an antibody against FGF9), 
which neutralizes native circulating FGF9, should be used in cases of overactivity of 
the FGFR3 receptor in differentiated tissues, which causes bone and cartilage growth 
arrest Such bone and cartilage growth arrest may lead to achondroplasia dwarfism, or 
5 other abnormalities of bone and cartilage growth, for example, multiple hereditary 
exostosis, solitary hereditary exostosis, hallux valgus deformity, synovial 
chondromatosis and endochondromas. 

The above conditions may be treated with a pharmaceutical 
composition comprising either an antagonist of FGF9, or an FGF9 binding agent 

1 0 capable of neutralizing native circulating FGF9, which both serve to attenuate the 
activity of FGFR3. 

Thus, FGF9 agonists can be used for the purpose of repair and 
regeneration of defective articular cartilage, for treatment of achondroplastic patients, 
for treatment of patients suffering from other growth disturbances and for treatment of 

1 5 physical injuries with poor predicted rate of cartilage and bone growth. They may also 
be used as interventions for manipulating the rate of growth within growth plates in 
order to increase the growth rate and/or prevent premature differentiation; or may be 
used for direct injection into the nucleus pulposus of the fine vertebrae in order to 
enhance the healing of spine injuries. FGF9 antagonists can be used to suppress the 

2 0 activity of a wild type FGFR3 receptor, for example, in the cases of various types of 
tumors and the like. See WO 96/41620. 
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As there is a need for compounds that selectively inhibit FGFR3 or act 
as a selective agonist for FGFR3, it would be desirable to have improved methods that 
facilitate the design of such compounds. 

The concept of rational drug design involves obtaining the precise 
5 three-dimensional molecular structure of a specific protein to permit design of drugs 
that selectively interact with and adjust the function of that protein. Theoretically, if 
the structure of a protein having a specified function is known, the function of the 
protein can be adjusted as desired. This permits a number of diseases and symptoms 
to be controlled. For example, CAPTOPRIL is a well known drug for controlling 

1 0 hypertension that was developed through rational drug design techniques. 

CAPTOPRIL inhibits generation of the angiotension-converting enzyme, thereby 
preventing the constriction of blood vessels. The potential for controlling disease 
through drugs developed by rational drug design is tremendous. The power of rational 
drug design has been reviewed by Bugg et al (1993). 

15 A requirement of rational drug design is the production of crystals of 

the desired target protein which provide for the determination of the detailed atomic 
structure of both the parent protein and its complex with the pharmaceutical. For this 
purpose, knowledge of the three-dimensional structure coordinates of FGF9 would be 
useful. Such information would aid in identifying and designing potential inhibitors 

2 0 and agonists of FGFR3 that in turn are expected to have therapeutic utility. 
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SUMMARY OF THE INVENTION 

The present invention provides crystallized FGF9. The structure 
coordinates reveal that the crystalline FGF9 shows a symmetric dimer with unique 
receptor and heparin binding surfaces. FGF9 crystallized in the tetragonal space group 
5 I4j with lattice constants a=l 5 1 .9 A, c=l 1 7.2 A. The structure has been refined to an 
R-value of R=22.0% (R fre e = 25.2%) at 2.6 A resolution. The four molecules in the 
asymmetric unit are arranged in two non-crystallographic dimers with the dimer 
interface composed partly of residues from N- and C-terminal extensions from the 
FGF-core structure. Most of the receptor-binding residues identified in FGF1- and 

1 0 FGF2-receptor complexes are buried in the dimer interface with the (38/09 loop 
stabilized in a particular conformation by an intramolecular hydrogen bonding 
network. The potential heparin binding sites are in a pattern distinct from FGF1 and 
FGF2. The carbohydrate moiety attached at N79 has no structural influences. 

The use of the crystal structure to design candidate agonists and 

1 5 antagonists of the FGFR3 may be accomplished in the following fashion. Once the 
crystal structure of the target (i.e., FGF9) is determined, computer modeling is 
conducted (using such programs as DOCK (Kuntz et al, 1982) or Multiple Copy 
Simultaneous Search (MCSS)(Mirankev et al, 1991)) to construct candidate agonist or 
antagonist compounds based on the crystal structure. These compounds are 

2 0 chemically synthesized and their biological activity is assayed. Preferably, such 
agonists or antagonists are mutants or fragments of FGF9 itself. For example, a 
preferred antagonist would be a mutant of FGF9 designed by computer modeling 
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based on the crystal structure of FGF9, which mutant bonds to the FGFR3 receptor 
without activating it. 

Furthermore, once the three-dimensional structure of a crystal 
comprising the FGF9 protein is determined, a potential ligand (antagonist or agonist) 
5 is examined through the use of computer modeling using a docking program such as 
GRAM, DOCK, or AUTODOCK (Dunbrack et al, 1997). This procedure can include 
computer fitting of potential ligands to the FGF9 dimer to ascertain how well the 
shape and the chemical structure of the potential ligand will complement or interfere 
with the dimer-dimer interaction (Bugg et al, 1993; West et al 1995)). Computer 

1 0 programs can also be employed to estimate the attraction, repulsion, and stearic 

hindrance of the ligand to the dimer-dimer binding site. Generally, the tighter the fit 
(e.g., the lower the stearic hindrance, and/or the greater the attractive force), the more 
potent the potential drug will be since these properties are consistent with a tighter 
binding constant. Furthermore, the more specificity in the design of a potential drug, 

15 the more likely that the drug will not interfere with other properties of the FGF9 

protein. This will minimize potential side effects due to unwanted interactions with 

* 

other proteins. 

Initially a potential ligand could be obtained by screening a random 
peptide library produced by recombinant bacteriophage for example, (Scott et al, 1990; 
2 0 Cwirla et al, 1990; Devlin et al, 1 990) or a chemical library. A ligand selected in this 
manner could then be systematically modified by computer modeling programs until 
one or more promising potential ligands are identified. Such analysis has been shown 
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to be effective in the development of HIV protease inhibitors (Lam et al, 1994; 
Wlodawer et al, 1993; Appelt, 1993; Erickson, 1993). 

Such computer modeling allows the selection of a finite number of 
rational chemical modifications, as opposed to the countless number of essentially 
5 random chemical modifications that could be made, and of which any one might lead 
to a useful drug. Each chemical modification requires additional chemical steps, 
which while being reasonable for the synthesis of a finite number of compounds, 
quickly becomes overwhelming if all possible modifications needed to be synthesized. 
Thus, through the use of the three-dimensional structure disclosed herein and 

1 0 computer modeling, a large number of these compounds can be rapidly screened on 
the computer monitor screen, and a few likely candidates can be determined without 
the laborious synthesis of untold numbers of compounds. 

Once a potential ligand (agonist or antagonist) is identified it can be 
either selected from a library of chemicals as are commercially available from most 

1 5 large chemical companies including Merck, Glaxo Welcome, Bristol Meyers Squib, 
Monsanto/Searle, Eli Lilly, Novartis and Pharmacia Upjohn, or alternatively the 
potential ligand may be synthesized de novo. As mentioned above, the de novo 
synthesis of one or even a relatively small group of specific compounds is reasonable 
in the art of drug design. The prospective drug can be physically tested to confirm its 

2 0 projected activity. For example, if the activity sought for such a potential ligand is its 
ability to prevent the binding of FGF9 to its receptor FGFR3, the potential ligand can 
be placed into any standard binding assay described below to test its effect on the 
FGF9-FGFR3 interaction. A preferred ligand for the purpose of this assay would be 
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one which is capable of binding to FGFR3 with a greater affinity than that of FGF9 for 
FGFR3. If the assay is conducted with FGFR3 on the surface of living cells, then one 
can determine whether or not the ligand which binds to FGFR3 causes signaling by the 
receptor. If it binds but does not cause signaling, then it is an antagonist If it binds 
5 and causes signaling, then it is an agonist. 

If the activity sought for such a potential ligand is its ability to bind 
directly to FGF9, this activity can be detected by means of a standard binding assay 
whereby the potential ligand may be selected on the basis of its having the capability 
of binding to FGF9. An antagonist may also be a ligand which binds to FGF9 so as to 

1 0 prevent FGF9 from binding to FGFR3 . The ability of the potential antagonist to have 
this activity may also be detected by means of a simple assay for binding to FGF9 in 
the presence of FGFR3, as is well known in the art. 

Other assays which can be conducted for potential ligands relate to the 
effect of heparin on FGF9. Potential ligands which interact with the heparin binding 

1 5 pockets of FGF9 may have a significant effect on the activity of FGF9, such as by 
preventing the heparin-dependent oligomerization thereof. Thus, once a potential 
ligand which may affect the heparin binding property of FGF9 is selected by means of 
computer modeling, the ability of the potential ligand to actually interfere with such 
binding may be determined in a standard binding assay to test its effect on the FGF9- 

2 0 heparin interaction. 

When a suitable drug is identified, a supplemental crystal can be grown 
which comprises a protein-ligand complex formed between the FGF9 protein and the 
drug. Preferably the crystal effectively diffracts X-rays allowing the determination of 
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the atomic coordinates of the protein-ligand complex to a resolution of greater than 5.0 
As, more preferably greater than 3 .0 A, and even more preferably greater than 2.0 A. 
The three-dimensional structure of the supplemental crystal can be determined by 
Molecular Replacement Analysis. Molecular replacement involves using a known 
5 three-dimensional structure as a search model to determine the structure of a closely 
related molecule or protein-ligand complex in a new crystal form. The measured X- 
ray diffraction properties of the new crystal are compared with the search model 
structure to compute the position and orientation of the protein in the new crystal. 
Computer programs that can be used include: X-PLOR and AMORE (Navaza, 1994). 

1 0 Once the position and orientation are known an electron density map can be calculated 
using the search model to provide X-ray phases. Thereafter, the electron density is 
inspected for structural differences and the search model is modified to conform to the 
new structure. Other computer programs that can be used to solve the structures of 
such crystals include QUANTA, CHARMM, INSIGHT, SYBYL, MACROMODE, 

15 andlCM. 

For all of the drug screening assays described herein further 
refinements to the structure of the drug will generally be necessary and can be made by 
the successive iterations of any and/or all of the steps provided by the particular drug 
screening assay. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a ribbon representation of the FGF9 dimer composed of 
chains A and D showing the carbohydrate moiety bound to each of the chains. 

11 
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Figure 3 shows the hydrogen bond network stabilizing the 08/09 loop. 
Molecule D is represented with the light colored chain on the left side of the figure, 
molecule A with darker chain trace on the right side of the figure. 

Figure 4 shows a diagram of a system used to carry out the instructions 
5 encoded by the storage medium of Figure 4 A and 4B. 

Figure 5 A shows a cross-section of a magnetic storage medium. 

Figure 5B shows a cross-section of an optically-readable data storage 

medium. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

10 Overall Structure Description 

The core unit of the FGF9 structure (Fig. 1) is formed by residues 62 to 
193 and is very similar to the structures of FGF1 and FGF2, as expected from the 
sequence alignment (Plotnikoy et al. 1999). Further features of the schematic 
representation of the FGF9 dimer are expanded upon in the inventors' recent 

1 5 publication (Hecht et al. 2001 Acta Cryst. D57). The rms difference to Ca-atoms of 
FGF1 (Blaber et al, 1996, and pdb-id 2afg) and FGF2 (Zhang et al, 1991, and pdb-id 
2FGF) is 0.8 A. Major differences (rmsd > 1.5 A) occur at Thr70/Gly71, where FGF1 
and FGF2 have an additional glycine, at the loop Asp88/Ser90, which may be 
correlated with the C-terminal extension in FGF9, and at Tyrl53/Argl61, where in 

2 0 FGF9 an insertion of three (relative to FGF1) or five (relative to FGF2) residues 

occurs. The loop Glul41/Asnl46 shows some variability already in FGF1 and FGF2. 
Compared to FGF1 the largest difference in Ca-positions in this loop is 3.9 A (at 
Alal42) while the largest difference is 1.9 A (at Alal42) compared to FGF2. The loop 
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containing in FGF9 the glycosylation site at Asn79 is identical to that of FGF1 and 
FGF2. N-terminal sequencing and Maldi-mass-spectrometry indicated heterogeneity 
of the crystallized protein with the major components starting at residues 19, 34, 38, 
and 42. In the structure, residues become visible in one of the molecules at residue 45 
5 with three flexible residues in an extended conformation turning into a helix between 
residues 48-62. In the other three molecules of the asymmetric unit the helical part is 
visible only from residue 52 onward. The C-terminal residues starting from residue 
193 form an irregular helix which shows some variability in the four molecules of the 
asymmetric unit. Together these N- and C4erminal parts form an extension clearly 

1 0 separate from the core structure. 
Quaternary Structure 

There is increasing evidence for the capacity of FGFs to undergo either 
spontaneous or heparin induced oligomerization, although the relation of such dimers 
and higher order oligomers to receptor binding and activation is still unclear. For 

1 5 FGF1 a heparin-linked dimeric structure has been reported (DiGabriele et al, 1998) 
while for FGF2 in the presence of heparin both monomeric and dimeric structures 
were observed (Faham et al, 1996). Moreover, chemical cross linking, 
ultracentrifugation experiments (Herr et al, 1997) and mass spectrometry techniques 
(Davis et al, 1999) provided evidence of self-oligomerization for FGF2 in the presence 

2 0 and in the absence of heparin. Nevertheless, in the structures of the FGF2 receptor 
complex (Plotnikov et al, 1999) and the FGF1 receptor complex (Stauber et al, 2000) 
both FGF molecules are separate and only linked via the receptor molecules. In these 
structures heparin is postulated to bind into a positively charged groove, created in the 

13 
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receptor dimer with the two termini bound to the heparin-binding domains of the 
FGF2 molecules (Plotnikov et al, 1999, Stauber et al, 2000). 

FGF9 readily dimerizes under physiological conditions, probably more 
easily than other FGFs, and dimers of FGF9 are frequently observed by 
5 immunoblotting lysates of RC J3 . 1 C5 . 1 8 mesenchymal cells (Garofalo et al, 1 999) and 
L-8. Accordingly, the FGF9 structure, crystallized in the absence of heparin, shows 
the four molecules of the asymmetric unit organized in two dimers related by non- 
crystallographic symmetry. The solvent accessible surface area, calculated with Grasp 

(Nicholls et al, 1 991), varies between 8848 A 2 and 9306 A 2 for the individual 
1 0 molecules, depending on the length of the extensions. The surface areas of the dimeric 

molecules, chains AD and BC, are 15826 A 2 and 15481 A 2 , yielding buried surface . 
areas of 2422 A 2 and 2420 A 2 or approximately 1200 A 2 per chain, well above the 
cutoff value of 400 A 2 per chain used as one of the classification criteria by the 
Protein Quaternary Structure server PQS (http://pqs.ebi.ac.uk/pqs-doc/pqs-doc.shtml). 

1 5 More than half of this buried surface of the dimer is contributed by the N- and C- 
terminal extensions, as the buried area per dimer is reduced to 1040 A 2 and 853 A 2 
when only the residues 62-193 of the FGF-core structure are used in the calculation. 
The lack of these structured terminal extensions therefore can be one of the reasons 
why similar dimer formation has not been observed, in the absence of heparin, in the 

2 0 FGF1 and FGF2 structures. For FGF2 the crystal structure with the highest resolution 
(pdb-id Ibgf) showed disorder for the N-tenninal first 19 to 20 residues (Ago et al, 
1991), confirmed by NMR studies of complete FGF2 (Lozano et al, 1998; pdb-id 
lrml) which showed disorder for the N-terminal 28 residues (Moy et al 1996). For 
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FGF1 the crystal structure with the highest resolution (Blaber et al, 1996) showed 
disorder for the N-terminal 9-10 residues (Blaber et al, 1996; pdb-id 2afg) and for the 
NMR structure a N-tenninally at residue 25 truncated molecule was used (Lozano et 
al, 1998; pdb-id lrml). 
5 Dimer Interface 

The dimer interface in FGF9 consists mainly of hydrophobic contacts 
but includes 4 hydrogen bonds and two salt bridges, related by non-crystallographic 
two-fold symmetry. The hydrogen bonds connect the side chain of Y67 with the side 
chain of N143 and the side chain of R64 with the backbone carbonyl of V192 where 

10 the C-terminal extension starts, while the salt bridges connect R62 with D193, also at 
the start of the C-terminal extension. The hydrophobic contacts are concentrated in a 
prominent hydrophobic cluster of the residues L54, L57, 160, L61, P194, V197 and 
L200 at the base of the structure, close to where the terminal extensions join the core. 
At the center and top of the core structure PI 91, LI 88 together with PI 89 and the 

1 5 hydrophobic parts of the side chains of Rl 90, Wl 44 and Yl 45 form an additional, 
though less pronounced hydrophobic patch. A potentially important structural 
difference between FGF9 and FGF1 and FGF2 occurs in the dimer interface with the 
noticeable shift of the p-turn linking P8 and P9 (residues 139-146, corresponding to 
96-104 in FGF2). In FGF9, the loop conformation is fixed by a hydrogen bond 

20 network involving residues H181, H186, E141 and E142 (Fig. 3). The arrangement is 
stabilized further by a salt bridge between El 42 and R69. Residues from this loop 
have been implicated in receptor binding (Venkataraman et al, 1999) and in the 
experimental FGF receptor complexes, where residues from this loop make extensive 

15 
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contacts to the receptor, the loop has been found to undergo some conformational 
change upon receptor binding (Plotnikov et al, 1999; Stauber et al, 2000). This 
conformational adaptation is likely to be much reduced in FGF9 due to the hydrogen 
bonding network. Stabilization of this loop in a particular conformation by residues 
5 not directly involved in receptor binding, as in FGF9, therefore could have significant 
implications on receptor affinity. In the structure of FGF7 (Ye et al, 1999, pdb-id 
1QQK), where E142 and R69 are conserved, the loop is in a conformation similar to 
FGF1 and FGF2 but lacks the salt bridge. Most likely the loop conformation in FGF9 
is influenced by the hydrogen bond between El 41 and HI 81 , which is unique to FGF9 

10 and FGF16. Similar interactions could occur in FGF5, which has two glutamines in 
these places, and in FGF10, which has glutamic acid and lysine. 

With the exception of residues from the terminal extensions most of the 
residues (Fig. 1) involved in the dimer interface in FGF9 correspond to residues 
identified as belonging to the major receptor binding sites in FGF2 (Venkataraman et 

15 al, 1999; Plotnikov et al, 1999; Stauber et al, 2000; Plotnikov et al, 2000). This is 

particularly true for residues Y67, Y145, LI 88, 160 and HI 86, corresponding in FGF2 
to Y24, Y103, L140, F17 and L138 and in FGF1 to residues Y15, Y94, L133, Y8 and 
L 1 3 1 which were found by Plotnikov et al, 1 999, and Stauber et al, 2000, to be in 
contact with the receptor. These residues are almost completely buried (less than 1 0 

20 A 2 solvent accessible surface, calculated with Areaimol (CCP4, 1 994)) in the FGF9 
dimer interface and, in order for them to become accessible to the receptor molecule, 
dissociation of these pre-formed dimers has to occur at least in FGF9. 
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In the experimental FGF receptor complexes (Plotnikov et al, 1999; 
Stauber et al, 2000) both the FGF ligand molecules are separate and linked only by 
heparin via the receptor molecules. A complete separation of the FGF9 dimer requires 
the separation of the extensive hydrophobic interactions at the N- and C-terminal 
5 extensions. As it seems unlikely that these hydrophobic residues remain exposed to 
solvent, at least three alternative scenarios can be proposed. 

1 . At present there is no experimental evidence that residues outside of 
the core-FGF structure participate in receptor binding although in the FGF1/FGFR2 
complex both FGF termini are in the vicinity of the receptor. In addition, preliminary 

1 0 results suggest that a complete deletion of both termini may have no apparent 

functional implications as evidenced by the capacity of a truncated form to both bind 
receptor and induce cell proliferation (Adar et al, in preparation). The function of 
these terminal residues could be therefore to provide stability to the unliganded FGF- 
molecules, probably correlated with the function as a non-cleaved secretion signal 

1 5 attributed to the 60 N-terminal residues by Revest et al (1 999), but become redundant 
and flexible at receptor binding. It is intriguing to suggest that the observed heparin- 
independent self association of FGF9 could have physiological significance by 
rendering the non-receptor bound FGF in a protected, non active form by utilizing the 
same residues defined for receptor binding for a homotypic dimer interface. 

20 2. These residues remain as a connecting region between the FGF 

molecules after a conformational change that exposes the buried receptor binding 
residues. Preliminary modeling suggests that this could be possible with hinge regions 
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probably in the area of residues 62 and 1 90-192. In this case the terminal extensions 
could connect adjacent ligand/receptor complexes to form multimeric assemblies. 

3. In the experimental receptor/ligand complexes (Plotnikov et al, 
1 999, Stauber et al, 2000) the secondary receptor binding sites are different from the 
5 sites identified by site directed mutagenesis as influencing receptor binding (Springer 
et al, 1994, Zhu et al, 1997, Zhu et al, 1998). This discrepancy presently is not clear 
and may point to the involvement of other determinants in FGF in receptor binding 
and activation. 

At least some of the FGFs, especially FGF3 and FGF1 6, show in the 
1 0 sequence alignment a similar pattern of hydrophobic and hydrophilic residues in these 
terminal extensions. However, due to the high sequence diversity and the structural 
flexibility, still more structural investigations of these homologs is yet required. 
Potential Heparin Binding Sites 

Heparin binding sites have been structurally identified in the heparin 
1 5 linked FGF1 dimer (DiGabriele et al, 1998; pdb-id 2axm) and in heparin complexes 
with FGF2 monomers (Faham et al, 1996; pdb-id lbfb) where prominently 
interactions of basic residues with the sugars, sulphate or carboxylate groups are 
involved. The surface of FGF9 contains three clusters of basic residues potentially 
suitable for heparin binding. At least one of these sites contains a bound sulphate 
2 0 molecule while in the other cases the discrimination between bound water and 
sulphate is less certain due to the limited resolution. The first site is in a pocket 
created by the insertion at Tyrl53/Argl61 and the sulphate ion is bound to R180, 
Yl 63 and the backbone nitrogen of Rl 61 . This pocket is at approximately 1 4 A 

18 
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distance from the nearest heparin binding site in FGF1 and FGF2 but could occur also 
in FGF16, FGF13 and FGF1 1 which have a highly homologous insertion and identical 
or homologous residues in position 163 and 180. The second site, where R137, K154 
and R161 form a cluster highly suggestive of sulphate binding, is even further away 
5 from the FGF1 and FGF2 homologous sites and is located almost on the opposite side 
of the molecule. A similar arrangement could occur in FGF16 as well, where R161 is 
replaced by a glutamine. The third site is formed by R173 and R177 which 
correspond to Kl 1 8 and R122 in the heparin binding loop in FGF1 (DiGabriele et al, 
1998; pdb-id 2axm) and to K125 and K129 in FGF2 (Faham et al, 1996; pdb-id lbfb). 

1 0 Fitting the heparin structures observed in FGF1 (DiGabriele et al, 1 998) and FGF2 
(Faham et al, 1 996) to FGF9, however, shows that the high affinity heparin binding 
site described by the residues N28, K126 and Q135 in FGF2 (Faham et al, 1996) is 
partially blocked in FGF9 by the side chain of Fl 84 which makes the backbone 
nitrogen atoms less accessible for sulphate binding as observed for FGF2 and FGF1 . 

15 In the experimental FGF2/FGFR1 complex, this site contains a bound sulphate ion and 
is proposed to bind the terminal part of heparin (Plotnikov et al, 1999). Sulphate ions 
visible in the experimental FGF1/FGFR2 complex (Stauber et al, 2000; pdb-id 1DJS), 
however, seem to correspond well with the potential heparin binding sites on FGF9. 
In this complex three sulphate ions are bound to FGF1, K128, Kl 1 8, and R122, 

20 probably with contributions by Kl 12 and Rl 19. In FGF9K1 83 corresponds to FGF1 
K128 and, in addition, R69 is directed very close to the sulphate bound to FGF1 K128. 
FGF9 Rl 73 and Rl 77 correspond to FGF1 Kl 1 8 and R122 and only a small 
adjustment due to Fl 84 would be necessary for similar sulphate binding to the 

19 
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complex. These fine adjustments in the spatial organization of the heparin binding 
residues in FGF9 may well coordinate with the distinct structural variants of sulfated 
domains on heparin sulfates, required for binding and activation of different members 
of the FGF family as well as of other heparin binding growth factors (Ornitz, 2000). 
5 Computer Representation 

The FGF9 X-ray coordinate data, when used in conjunction with a 
computer programmed with software to translate those coordinates into the 3- 
dimensional structure of FGF9 may be used for a variety of purposes, especially for 
purposes relating to drug discovery. Such software for generating 3 -dimensional 

1 0 graphical representations are known and commercially available. The ready use of the 
coordinate data requires that it be stored in a computer-readable format Thus, in 
accordance with the present invention, data capable of being displayed as the 3- 
dimensional structure of FGF9 and portions thereof and their structurally similar 
homologs is stored in a machine-readable storage medium, which is capable of 

1 5 displaying a graphical 3-dimenslonal representation of the structure. 

Therefore, another embodiment of this invention provides a machine- 
readable data storage medium, comprising a data storage material encoded with 
machine-readable data which, when used by a machine programmed with instructions 
for using said data, displays a graphical 3-dimensional representation of a molecule or 

2 0 molecular complex comprising FGF9, or a homolog of said molecule or molecular 
complex, wherein said homolog comprises a binding pocket that has a root mean 
square deviation from the backbone atoms of said amino acids of not more than about 
1.1 5A. 
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Even more preferred is a machine-readable data storage medium that is 
capable of displaying a graphical 3-dimensional representation of a molecule or 
molecular complex that is defined by the structure coordinates of all of the amino 
acids in Figure 2 or a homolog of said molecule or molecular complex, wherein said 
5 homolog has a root mean square deviation from the backbone atoms of all of the 
amino acids in Figure 2 of not more than about 1.15A. 

According to an alternate embodiment, the machine-readable data 
storage medium comprises a data storage material encoded with a first set of machine- 
readable data which comprises the Fourier transform of the structure coordinates set 

1 0 forth in Figure 2, and which, when using a machine programmed with instructions for 
using said data, can be combined with a second set of machine-readable data 
comprising the X-ray diffraction pattern of another molecule or molecular complex to 
determine at least a portion of the structure coordinates corresponding to the second 
set of machine-readable data. 

1 5 For example, the Fourier transform of the structure coordinates set forth 

in Figure 2 may be used to determine at least a portion of the structure coordinates of 
FGF9. 

According to an alternate embodiment, this invention provides a 
computer for producing a 3-dimensional representation of a molecule or molecular 
2 0 complex, wherein said molecule or molecular complex comprises all of the FGF9 
amino acids, wherein said computer comprises: 
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(a) a machine-readable data storage medium comprising a data storage 
material encoded with machine-readable data, wherein said machine-readable data 
comprises the structure coordinates of FGF9 or portions thereof; 

(b) a working memory for storing instructions for processing said 
5 machine-readable data; 

(c) a central-processing unit coupled to said working memory and to 
said machine-readable data storage medium, for processing said machine-readable 
data into said 3-dimensional representation; and 

(d) an output hardware coupled to said central processing unit, for 
1 0 receiving said 3-dimensional representation. 

Figure 4 demonstrates one version of these embodiments. System 1 0 
includes a computer 1 1 comprising a central processing unit ("CPU") 20, a working 
memory 22 which may be, e.g., RAM (random-access memory) or "core" memory, 
mass storage memory 24 (such as one or more disk drives or CD-ROM drives), one or 

1 5 more cathode-ray tube ("CRT") display terminals 26, one or more keyboards 28, one 
or more input lines 30, and one or more output lines 40, all of which are 
interconnected by a conventional bidirectional system bus 50. 

Input hardware 36, coupled to computer 1 1 by input lines 30, may be 
implemented in a variety of ways. Machine-readable data of this invention may be 

2 0 inputted via the use of a modem or modems 32 connected by a telephone line or 
dedicated data line 34. Alternatively or additionally, the input hardware 36 may 
comprise CD-ROM drives or disk drives 24. In conjunction with display terminal 26, 
keyboard 28 may also be used as an input device. 
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Output hardware 46, coupled to computer 1 1 by output lines 40, may 
similarly be implemented by conventional devices. By way of example, output 
hardware 46 may include CRT display terminal 26 for displaying a graphical 
representation of a binding pocket of this invention using a program such as 
5 QUANTA as described herein. Output hardware might also include a printer 42, so 
that hard copy output may be produced, or a disk drive 24, to store system output for 
later use. 

In operation, CPU 20 coordinates the use of the various input and 
output devices 36, 46 coordinates data accesses from mass storage 24 and accesses to 

1 0 and from working memory 22, and determines the sequence of data processing steps. 
A number of programs may be used to process the machine-readable data of this 
invention. Such programs are discussed in reference to the computational methods of 
drug discovery as described herein. Specific references to components of the 
hardware system 10 are included as appropriate throughout the following description 

15 of the data storage medium. 

Figure 5 A shows a cross section of a magnetic data storage medium 
1 00 which can be encoded with a machine-readable data that can be carried out by a 
system such as system 1 0 of Figure 4. Medium 1 00 can be a conventional floppy 
diskette or hard disk, having a suitable substrate 101, which may be conventional, and 

20 a suitable coating 102, which may be conventional, on one or both sides, containing 
magnetic domains (not visible) whose polarity or orientation can be altered 
magnetically. Medium 1 00 may also have an opening (not shown) for receiving the 
spindle of a disk drive or other data storage device 24. The magnetic domains of 
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coating 1 02 of medium 100 are polarized or oriented so as to encode in manner which 
may be conventional, machine-readable data such as that described herein, for 
execution by a system such as system 10 of Figure 4. 

Figure 5B shows a cross-section of an optically-readable data storage 
5 medium 1 1 0 which also can be encoded with such a machine-readable data, or set of 
instructions, which can be carried out by a system such as system 10 of Figure 4. 
Medium 1 10 can be a conventional compact disk read only memory (CD-ROM) or a 
rewritable medium such as a magneto-optical disk that is optically readable and 
magneto-optically writable. Medium 100 preferably has a suitable substrate 111, 

1 0 which may be conventional, and a suitable coating 1 12, which may be conventional, 
usually of one side of substrate 111. 

In the case of CD-ROM, as is well known, coating 1 12 is reflective and 
is impressed with a plurality of pits 1 13 to encode the machine-readable data. The 
arrangement of pits is read by reflecting laser light off the surface of coating 1 12. A 

15 protective coating 1 14, which preferably is substantially transparent, is provided on 
top of coating 112. 

In the case of a magneto-optical disk, as is well known, coating 1 12 has 
no pits 1 1 3, but has a plurality of magnetic domains whose polarity or orientation can 
be changed magnetically when heated above a certain temperature, as by a laser (not 

2 0 shown). The orientation of the domains can be read by measuring the polarization of 
laser light reflected from coating 1 12. The arrangement of the domains encodes the 
data as described above. 
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As mentioned above, the FGF9 X-ray coordinate data is useful for 
screening and identifying drugs that inhibit FGF9. For example, the structure encoded 
by the data may be computationally evaluated for its ability to associate with putative 
substrates or ligands. Such compounds that associate with FGF9 may inhibit FGF9, 
5 and are potential drug candidates. Additionally or alternatively, the structure encoded 
by the data may be displayed in a graphical 3-dimensional representation on a 
computer screen. This allows visual inspection of the structure, as well as visual 
inspection of the structure's association with the compounds. 

Thus according to another embodiment, the method evaluates the 
1 0 potential of a chemical entity to associate with a molecule or molecular complex 
defined by the structure coordinates of all of the FGF9 amino acids, as set forth in 
Figure 2, or a homolog of said molecule or molecular complex having a root mean 
square deviation from the backbone atoms of said amino acids of not more than 1.1 A. 
This method comprises the steps of: 
15 a) creating a computer model of the molecular or molecular complex 

using the structure coordinates as set forth in Figure 2, or a homolog of said molecule 
or molecular complex having a root mean square deviation from the backbone atoms 
of said amino acids not more than about 1.1 5 A; 

b) employing computational means to perform a fitting operation 
2 0 between the chemical entity and said computer model of the binding pocket; and 

c) analyzing the results of said fitting operation to quantify the 
association between the chemical entity and the binding pocket model. 
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The term "chemical entity", as used herein, refers to chemical 
compounds or ligands, complexes of at least two chemical compounds, and fragments 
of such compounds or complexes. 

More preferred is the use of the atomic coordinates of all the amino 
5 acids of FGF9 according to Figure 2 ± a root mean square deviation from the 
backbone atoms of said amino acids of not more than 1.15A, to generate a 3- 
dimensional structure of FGF9. 

For the first time, the present invention permits the use of molecular 
design techniques to identify, select or design potential inhibitors of FGF9, based on 
1 0 the structure of thereof. Such a predictive model is valuable in light of tiie high costs 
associated with the preparation and testing of the many diverse compounds that may 
possibly bind to the FGF9 protein. 

According to this invention, a potential FGF9 inhibitor may now be 
evaluated for its ability to bind a FGF9-like binding pocket prior to its actual synthesis 
1 5 and testing. If a proposed compound is predicted to have insufficient interaction or 
association with the binding pocket, preparation and testing of the compound is 
obviated. However, if the computer modeling indicates a strong interaction, the 
compound may then be obtained and tested for its ability to bind. 

A potential inhibitor of a FGF9-like binding pocket may be 
2 0 computationally evaluated by means of a series of steps in which chemical entities or 
fragments are screened and selected for their ability to associate with the FGF9-like 
binding pockets. 
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One skilled in the art may use one of several methods to screen 
chemical entities or fragments for their ability to associate with a FGF9-like binding 
pocket. This process may begin by visual inspection of, for example, a FGF9-like 
binding pocket on the computer screen based on the FGF9 structure coordinates in 
5 Figure 2 or other coordinates which define a similar shape generated from the 

machine-readable storage medium. Selected fragments or chemical entities may then 
be positioned in a variety of orientations, or docked, within that binding pocket as 
defined above. Docking may be accomplished using software such as Quanta and 
Sybyl, followed by energy minimization and molecular dynamics with standard 
1 0 molecular mechanics force fields, such as CHARMM and AMBER. 

Specialized computer programs may also assist in the process of 
selecting fragments or chemical entities. These include: 

1 . GRID (Goodford, 1985), which is available from Oxford University, 

Oxford, UK. 

15 2. MCSS (Miranker et al, 1991), which is available from Molecular 

Simulations, San Diego, CA. 

3. AUTODOCK (GoodseU et al, 1990), which is available from 
Scripps Research Institute, La Jolla, CA. 

4. DOCK (Kuntz et al, 1982), which is available from University of 
2 0 California, San Francisco, CA, 

Once suitable chemical entities or fragments have been selected, they 
can be designed or assembled into a single compound or complex. Assembly may be 
preceded by visual inspection of the relationship of the fragments to each other on the 
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3-dimensional image displayed on a computer screen in relation to the structure 
coordinates of FGF9. This would be followed by manual model building using 
software such as Quanta or Sybyl (Tripos Associates, St. Louis, MO). Useful 
programs to aid one of skill in the art in connecting the individual chemical entities or 
5 fragments include: 

1. CAVEAT (Bartlett et al, 1989; Lauri et al, 1994), which is available 
from the University of California, Berkeley, CA. 

2. 3D Database systems, such as ISIS (MDL Information Systems, San 
Leandro, CA). This area is reviewed in Martin, 1992. 

10 3 . HOOK (Eisen et al, 1 994), which is available from Molecular 

Simulations, San Diego, CA. 

Instead of proceeding to build an inhibitor of a FGF9-like binding 

pocket in a step-wise fashion one fragment or chemical entity at a time as described 

above, inhibitory or other FGF9 binding compounds may be designed as a whole or 
1 5 "de novo 1 ' using either an empty binding site or optionally including some portion(s) of 

a known inhibitor(s). There are many de novo ligand design methods including: 

1 . LUDI (Bohm, 1 992), which is available from Molecular Simulations 
Incorporated, San Diego, CA. 

2. LEGEND (Nishibata et al, 1991), which is available from Molecular 
20 Simulations Incorporated, San Diego, CA. 

3. LeapFrog (available from Tripos Associates, St. Louis, MO). 

4. SPROUT (Gillet et al, 1993), which is available from the University 

of Leeds, UK. 
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Other molecular modeling techniques may also be employed in accordance with this 
invention (see, e.g., Cohen et al, 1990; Navia et al, 1992; Balbes et al, 1994; Guida, 
1994). 

Once a compound has been designed or selected by the above methods, 
5 the efficiency with which that entity may bind to a FGF9 binding pocket may be tested 
and optimized by computational evaluation. For example, an effective FGF9 binding 
pocket inhibitor must preferably demonstrate a relatively small difference in energy 
between its bound and free states (i.e., a small deformation energy of binding). Thus, 
the most efficient FGF9 binding pocket inhibitors should preferably be designed with 

10 a deformation energy of binding of not greater than about 1 0 kcal/mole, more 

preferably, not greater than 7 kcal/mole. FGF9 binding pocket inhibitors may interact 
with the binding pocket in more than one of multiple conformations that are similar in 
overall binding energy. In those cases, the deformation energy of binding is taken to 
be the difference between the energy of the free entity and the average energy of the 

1 5 conformations observed when the inhibitor binds to the protein. 

An entity designed or selected as binding to a FGF9 binding pocket 
may be further computationally optimized so that in its bound state it would preferably 
lack repulsive electrostatic interaction with the target enzyme and with the surrounding 
water molecules. Such non-complementary electrostatic interactions include repulsive 

2 0 charge-charge, dipole-dipole and charge-dipole interactions. 

Specific computer software is available in the art to evaluate compound 
deformation energy and electrostatic interactions. Examples of programs designed for 
such uses include: Gaussian 99, revision C (M. J. Frisch, Gaussian, Inc., Pittsburgh, 
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PA, ©1995); AMBER, version 4.1 (P. A. Kollman, University of California at San 
Francisco, ©1995); QUANTA/CHARMM (Molecular Simulations, Inc., San Diego, 
CA ©1995); Insight H/Discover (Molecular Simulations, Inc., San Diego, GA ©1995); 
DelPhi (Molecular Simulations, Inc., San Diego, CA ©1995); and AMSOL (Quantum 
5 Chemistry Program Exchange, Indiana University). These programs may be 

implemented, for instance, using a Silicon Graphics workstation such as an Indigo 2 
with "IMPACT" graphics. Other hardware systems and software packages will be 
known to those skilled in the art. 

Another approach enabled by this invention, is the computational 

1 0 screening of small molecule databases for chemical entities or compounds that can 

bind in whole, or in part, to a FGF9 binding pocket. In this screening, the quality of fit 
of such entities to the binding site may be judged either by shape complementarily or 
by estimated interaction energy (Meng et al, 1992). 

According to another embodiment, the invention provides compounds 

1 5 that associate with a FGF9-like binding pocket produced or identified by the method 
set forth above. 

The structure coordinates set forth in Figure 2 can also be used to aid in 
obtaining structural information about another crystallized molecule or molecular 
complex. This may be achieved by any of a number of well-known techniques, 
2 0 including molecular replacement. 

In order that this invention be more fully understood, the following 
example is set forth. This example is for the purpose of illustration only and is not to 
be construed as limiting the scope of the invention in any way. 
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EXAMPLE: 

Materials and Methods 

The full-length coding region for human FGF9 (Miyamoto et al, 1 993) 
5 cDNA was isolated as a BamHI/blunt fragment from pET vector (Kuriyama et al, 
1995) and subcloned into the vector pBacPAK9 digested with Bgffi and Smal. 
Plasmids containing the cDNA species in proper orientation were isolated from 
bacteria, used for transfection into Sf9 cells with purified linearized baculovirus DNA. 
Screening for recombinant viruses, cloning and propagation or rec. viruses were 

1 0 performed as described (Fiebich et al, 1993). For purification of FGF9 protein from 
the insect cell serum-free supernatant, it was adjusted to 0.6 M NaCl and purified over 
a 5 ml HiTrap heparin column (Pharmacia Amersham). FGF9 containing samples 
were pooled, diluted 1 :3 with 20 mM Tris/Cl pH 7.4 and applied to a 5-ml TSK- 
heparin-affinity FPLC column (TosoHaas). Bound proteins were eluted with a 20ml 

15 gradient of 0.4-1.5 MNaCl in buffer A (20 mM Tris/Cl, pH 7.4). Aiiquots of 1 ml 
fraction containing FGF9 were used for SDS/PAGE and for silver staining of the gel. 

The protein concentration was measured with a standard assay (BCA, 
Pierce). For amino-terminal sequencing of glycosylated rh FGF9, 20 mg protein from 
the biological active fractions (estimated with BALBc-3T3 cells, not shown) were 

2 0 loaded onto a Applied Biosystems 473 A gas-phase protein sequenator. Twenty 

rounds of Edman degradation were carried out using standard protocols and chemicals 
supplied by Applied Biosystems (ca. 50% pos. 19 and 50% pos. 34 of the coding 
region). 
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Crystals were grown with the sitting drop method to a typical size of 
0.2x0.2x0.2 mm from solutions containing FGF9 at a concentration of 2.1 mg/ml and 
2.0 M ammonium sulphate, buffered at pH 5.2 with 0. 1 M MES/Tris buffer. The 
statistics of the native data set, collected at the MPG-GBF beamline BW6 of the 
5 DESY synchrotron from a shock-cooled crystal to a resolution of 2.6 A, are given in 
Table 1. Indexing and scaling the data set with Mosflm (CCP4, 1994) and Scala 
(CCP4, 1994) proved the space group to be tetragonal 14 j with lattice constants 

a=151.9 A, c=117.2 A. The asymmetric unit contains four molecules showing clear 
two-fold symmetry in a pseudo-14122 arrangement and in addition a pseudo-cubic 

1 0 three-fold axis in the self-rotation function calculated with Glrf (Tong et al, 1 997). 
The structure was solved by molecular replacement. The successful run of EPMR 
(Kissinger et al, 1999) used the coordinates of FGF1 (Blaber et al, 1 996; pdb-id 2afg), 
modified by replacement of all non-glycine residues by alanine, and identified clearly 
three of the four molecules in the asymmetric unit with a correlation factor of 0.296. 

15 The fourth molecule was placed manually by complementing the third molecule to a 
dimer identical to the first two molecules. The structure was refined using CNS 
(Brunger et al, 1998), Refinac (CCP4, 1994), and O (Jones et al, 1991). Water 
molecules were added using Arpp/Refmac (CCP4, 1994) until the decrease of the free 
R-factor stopped. In the last stages of the refinement positional restraints for the non- 

2 0 crystallographic symmetry were dropped, but, due to the limited resolution, only 
grouped temperature factors for main chain and side chain atoms were refined. 
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TABLE 1 
Data Set and Refine Statistics 

Space Group I4 X 
Unit Cell Parameters 

a (A) 151.9 

c(A) 117.2 

Resolution Range (A) 39.5-2.6 

Unique Reflections 40985 

Completeness (%) 99.9 (99.9) 

I/sigma(I) 9.5 (3.4) 
Rail (%) . 22.0 

R free (%) 25.0 

Resolution 40-2.6 

nr. residues 624 

nr. sugars 10 

nr. sulfates 8 

nr. waters 141 

Coord. Error * 0.18 

Core Region (%)$ 90.0 

ncs-rms(A)t 0.58 



values in parenthesis are for the highest resolution shell 2.74 A-2.6 A 

5 

R merge = (Zli,(hkl) - <I(hkl)>) / (£li,(hkl), 
Rail = (3Fo(hkl) - F c (hkl)D / (2F 0 (hkl) 

* calculated with SIGMAA (CCP4) and J calculated with PROCHECK (CCPR) 

10 

f rms deviation of Ca protein atoms related by non-crystallographic symmetry 
calculated with LSQMAN (Kleywegt et al, 1 997) 

N-terminal sequencing and Maldi-mass-spectrometry indicated 
1 5 heterogeneity of the crystallized protein with the major components starting at residues 
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19, 34, 38, and 42 (Swissprot id FGF9_HUMAN). The glycoconjugate is, according 
to Maldi-mass-spectrometry, of the three-mannosyl insect type with 2 N- 
acetylglucosamines, 3 mannose and one fucose moiety, a minor component having 
two fucose molecules, as expected from the expression system. The structure shows 
5 clearly in all four molecules at the N79 glycosylation site density for the two N- 

acetylglucosamines together with one fucose molecule, the rest of the carbohydrate is 
disordered. In the crystal all four molecules of the asymmetric unit show flexibility of 
the N-terminal and, to a lesser extent, the C-terminal residues. 

The first residue visible in the electron density is in one molecule 

1 0 Leu45 and in the others Thr52, C-terminal residues are visible up to 208, the native C- 
terrninus, in one molecule, to 206 in two others, and to 204 in the last molecule. The 
average rmsd between all Ca-atoms common to the four molecules in the asymmetric 
unit is 0.6 and 0.3 for the residues 62 to 193. The final refinement statistics for the 
model consisting of 623 amino acid residues, 10 carbohydrate, 141 water and 8 

1 5 sulphate molecules are given in Table 1 . The coordinates are set forth in Figure 2. 

The foregoing description of the specific embodiments will so fully 
reveal the general nature of the invention that others can, by applying current 
knowledge, readily modify and/or adapt for various applications such specific 
embodiments without undue experimentation and without departing from the generic 

2 0 concept, and, therefore, such adaptations and modifications should and are intended to 
be comprehended within the meaning and range of equivalents of the disclosed 
embodiments. It is to be understood that the phraseology or terminology employed 
herein is for the purpose of description and not of limitation. The means, materials, 
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and steps for carrying out various disclosed functions may take a variety of alternative 
forms without departing from the invention. Thus the expressions "means to..." and 
"means for...", or any method step language, as may be found in the specification 
above and/or in the claims below, followed by a functional statement, are intended to 
5 define and cover whatever structural, physical, chemical or electrical element or 

structure, or whatever method step, which may now or in the future exist which carries 
out the recited function, whether or not precisely equivalent to the embodiment or 
embodiments disclosed in the specification above, i.e., other means or steps for 
carrying out the same functions can be used; and it is intended that such expressions 
10 be given their broadest interpretation. 
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WHAT IS CLAIMED IS: 

1 . An FGF9 crystal in a tetragonal space group 14 1 with lattice constants a=l 51 .9 
Aandc=117.2 A. 

2. A crystal in accordance with claim 1 refined to an R-value of about 22% at 2.6 
5 A resolution. 

3 . A composition consisting essentially of FGF9 in crystalline form. 

4. A method of using the crystal of claim 1 in a drug screening assay, comprising: 

(a) selecting a potential ligand by performing rational drug design with the 
three-dimensional structure determined for the crystal, wherein said selecting is 

1 0 performed in conjunction with computer modeling; 

(b) contacting the potential ligand with FGF9; and 

(c) detecting the binding of the potential ligand for FGF9, wherein a 
potential drug is selected on the basis of its having the capability of binding to FGF9. 

5. A method of using the crystal of claim 1 in a drug screening assay, comprising: 
1 5 (a) selecting a potential ligand by performing rational drug design with the 

three-dimensional structure determined for the crystal, wherein said selecting is 
performed in conjunction with computer modeling; 

(b) contacting the potential ligand with FGFR3 ; and 

(c) detecting the binding of the potential ligand to FGFR3, wherein a 

2 0 potential drug is selected on the basis of its having the capability of binding to FGFR3 
with a greater affinity than that of FGF9 for FGFR3. 
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6. A method of using the crystal of claim 1 in a drug screening assay comprising: 
(a) selecting a potential antagonist by performing rational drug design with 

the three-dimensional structure determined for the crystal, wherein said selecting is 
performed in conjunction with computer modeling; 
5 (b) adding the potential antagonist to a mixture of FGF9 and FGFR3 ; and 

(c) detecting the ability of the potential antagonist to prevent binding of 
FGF9 to FGFR3, wherein a potential antagonist that inhibits the binding of FGF9 to 
FGFR3 is selected as a potential drug. 

7. A method in accordance with any one of claims 4-6, wherein said potential 
1 0 ligand or potential antagonist is a mutant or fragment of FGF9. 

8. A crystal in accordance with claim 1 , wherein the four molecules of the 
asymmetric unit are organized in two dimers related by non-ciystallographic symmetry. 

9. A model of FGF9, wherein said model represents a three-dimensional structure 
that substantially conforms to the atomic coordinates of Figure 2. 

15 10. A computer-assisted method of structure based drug design of bioactive 

compounds using the model of claim 9, comprising: 

providing said model in the form of a computer image generated when the 

coordinates listed in Figure 2 are analyzed on a computer using a graphical display 

software program to create an electronic file of said image and visualizing said electronic 
20 files on a computer capable of representing said electronic file as a three-dimensional 

image; 

designing a chemical compound using said computer image; and 
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chemically synthesizing said chemical compound. 

11. A method in accordance with claim 10, wherein said step of designing 
comprises computational screening of one or more databases of chemical compounds in 
which the three-dimensional structure of said compounds are known. 

5 12. A three-dimensional computer image of the three-dimensional structure of 
FGF9. 

13. The image of claim 12, wherein said structure substantially conforms with the 
three-dimensional coordinates listed in Figure 2. 

14. A computer-readable data storage medium comprising a data storage material 
1 0 encoded with computer-readable data, wherein said computer-readable data comprises a 

set of three-dimensional coordinates of FGF9 having a three-dimensional structure that 
substantially conforms to the atomic coordinates of Figure 2, wherein, using a graphical 
display software program, said data creates an electronic file that can be visualized on a 
computer capable of representing said electronic file as a three-dimensional image. 

15 15. A computer for producing a three-dimensional representation of FGF9, 
wherein said computer comprises: 

a computer-readable data storage medium in accordance with claim 14; 
a working memory for storing instructions for processing said computer-readable data; 
a central-processing unit coupled to said working memory and to said computer-readable 
2 0 data storage medium, for processing said computer-readable data into said three- 
dimensional representation; and 
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an output hardware coupled to said central processing unit, for receiving said three- 
dimensional representation. 
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FIGURE 2 

REMARK coordinates from minimization and B-factor refinement 

REMARK refinement resolution: 500,0 - 2.6 A 

REMARK starting r= 0.2186 free_r= 0.2494 

REMARK final p= 0.21 88 free_r= 0.2500 

REMARK rmsd bonds= 0.006201 rmsdangles= 1.30156 

REMARK B rmsd for bonded mainchain atoms= $brms_bond_1 target= &bsig_main 

REMARK B rmsd for angle mainchain atoms* $brms_angl 1 target= &asig jnain 

REMARK target= mlf final wa= 1 .56562 final rweight=$b_niveight 

REMARK md-method= cartesian annealing schedule= slowcool 

REMARK starting temperature= 2000 total md steps= 20 * 50 

REMARK cycles= 2 coordinate steps= 50 B-factor steps= 30 

REMARK sg= 14(1) a= 151.95 b= 151.95 c= 117.23 alpha= 90 beta= 90 gamma= 90 

REMARK topology file 1 : CNS_TOPPAR:protein.top 

REMARK topology file 2 : CNS J"OPPAR:dna-rna.top 

REMARK topology file 3 : CNSJ"OPPAR:water.top 

REMARK topology file 4 : CNS J"OPPAR:ion.top 

REMARK topology file 5 : CNSJTOPPAR.carbohydrate.top 

REMARK parameter file 1 : CNSJ"OPPAR:proteinjrep.param 

REMARK parameter file 2 : CNS J"OPPAR:dna-ma _rep.param 

REMARK parameter file 3 : CNSJTOPPAR:waterj"ep.pararh 

REMARK parameter file 4 : CNS_TOPPAR:ion.param 

REMARK parameter file 5 : CNS_TOPPAR:carbohydrate.param 

REMARK molecular structure file: fgf9_60.mtf 

REMARK input coordinates: fgf_r60.xpdb 

REMARK reflection file= fgf9hj-u.refl 

REMARK ncs= none 

REMARK B-correction resolution: 6.0 - 2.6 

REMARK initial B-factor correction applied to fobs : 

REMARK B11= 3.191 B22= 3.191 B33= -6.383 

REMARK B12= 0.000 B13= 0.000 B23= 0.000 

REMARK B-factor correction applied to coordinate array B: 0.876 

REMARK bulk solvent: density level= 0.360313 elA*3, B-factor= 49.9235 A A 2 

REMARK reflections with |Fobs|/sigma_F < 0.0 rejected 

REMARK reflections with |Fobs| > 10000 * rms(Fobs) rejected 

REMARK theoretical total number of refl. in resol. range: 40991 ( 100.0 % ) 

REMARK number of unobserved reflections (no entry or |F|=0): 16 ( 0.0 % ) 

REMARK number of reflections rejected: D ( 0.0 % ) 

REMARK total number of reflections used: 40975 ( 100.0 % ) 

REMARK number of reflections in working set: 38917 ( 94.9 % ) 

REMARK number of reflections in test set: 2058 ( 5.0 % ) 

CRYST1 151,950 151.950 117.230 90.00 90.00 90.00141 

REMARK FILENAME=7sf/bhf/fgf/cns/fgf9 r60 Lpdb" 

REMARK DATE-.1 7-Jul-OO 12:21:18 created by user bhf 

REMARK VERSION:1.0 
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